Techniques for Dealing with Missing Values
نویسندگان
چکیده
A brief overview of the history of the development of decision tree induction algorithms is followed by a review of techniques for dealing with missing attribute values in the operation of these methods. The technique of dynamic path generation is described in the context of tree-based classiication methods. The waste of data which can result from casewise deletion of missing values in statistical algorithms is discussed and alternatives proposed.
منابع مشابه
تحلیل درستنمایی ماکزیمم مدل رگرسیون لجستیک در حالتی که داده های متغیرهای پیشگو کامل نیستند ولی متغیرهای کمکی وجود دارند
Background and Objectives: Missing data exist in many studies, e.g. in regression models, and they decrease the model's efficacy. Many methods have been suggested for handling incomplete data: they have generally focused on missing outcome values. But covariate values can also be missing.Materials and Methods: In this paper we study the missing imputation by the EM algorithm and auxiliary varia...
متن کاملCombined association rules for dealing with missing values
With the rapid increase in the use of databases, the problem of missing values inevitably arises. The techniques developed to effectively recover these missing values should be highly precise in order to estimate the missing values completely. The mining of association rules can effectively establish the relationship among items in databases. Therefore, discovered association rules are usually ...
متن کاملDealing with missing data in a multi-question depression scale: a comparison of imputation methods
BACKGROUND Missing data present a challenge to many research projects. The problem is often pronounced in studies utilizing self-report scales, and literature addressing different strategies for dealing with missing data in such circumstances is scarce. The objective of this study was to compare six different imputation techniques for dealing with missing data in the Zung Self-reported Depressi...
متن کاملPreparing the Data
Techniques for preprocessing data for data mining are discussed. Issues include scaling numerical data, attribute transformation, dealing with missing values, representation of time-dependent data, and outlier detection. Directory • Table of
متن کاملFrequency Ratio: a method for dealing with missing values within nearest neighbour search
In this paper we introduce the Frequency Ratio (FR) method for dealing with missing values within nearest neighbour search. We test the FR method on known medical datasets from the UCI machine learning repository. We compare the accuracy of the FR method with five commonly used methods (three “imputation” and two “bypassing” methods) for dealing with values that are “missing completely at rando...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997